Skip to content

Causality Discovery

Introduction to Causality

Definition

  • Causality is the influence one event (cause) has on another (effect).
  • It implies that changes in the cause lead to changes in the effect, forming a non-random link.

Key Characteristics of Causality

  • Directionality: A causes B, but B does not necessarily cause A.
  • Mechanism: Changes in the cause generate changes in the effect.
  • Counterfactuals: Considers what would happen to the effect if the cause did not occur.

Causality Discovery

Methods

  • Experiment-based approach
    • Control experiment: Intervention causes changes in outcomes,
    • In many cases too expensive, too time-consuming, or even impossible.
  • Data-based approach

Overview of Data-based Causal Discovery Methods

Methods

  • Constraint-based methods
    • PC
    • FCI
  • Score-based methods
    • GES (Greedy Equivalence Search)
  • Functional Causal Models
    • Linear, Non-Gaussian Model
    • Non-linear Methods
  • Hybrid Methods
    • SELF (Structural Equational Likelihood Framework)
    • FRITL (Functional Representation with Independent Triad and Likelihood)

Constraint-Based Methods

Assumptions

  • Causal Markov Assumption: A variable X is independent of every other variable (except X's effects) conditional on all of its direct causes.
  • Causal Faithfulness Assumption: For all observed variables, Xi is independent of Xj conditional on variables Z if and only if the Markov Assumption for G entails such conditional independencies.

GCM

Limitations

  • DAGs within the same Markov Equivalence Class cannot be distinguished solely based on conditional independence relationships.

CausalInference

Constraint-Based Method: PC Algorithm

  1. Initialize Graph: Start with a fully connected undirected graph.
  2. Edge Removal: Test conditional independence for each pair of variables given subsets of other variables. Remove edges where conditional independence is found.
  3. Identify Colliders: Orient edges for v-structures (XZY) where X and Y are independent unless conditioned on Z.
  4. Orient Remaining Edges: Use orientation rules to direct undetermined edges.
  5. Output CPDAG: The result is a CPDAG representing the Markov Equivalence Class.

PC Algorithm Example

PC-Example

PC Algorithm Limitation

  • Limitations: Unable to deal with latent confounders.

Constraint-Based Method: FCI Algorithm Process

  1. Initialize Graph: Start with a fully connected undirected graph over all observed variables.
  2. Edge Removal: Test conditional independence between each pair of variables given subsets of other variables.
  3. Identify Colliders: Identify v-structures (XZY).
  4. Propagate Edge Orientations: Apply orientation rules to propagate edge directions.
  5. Handle Ambiguous Relationships: Determine possible orientations considering latent variables.
  6. Output PAG: The result is a Partial Ancestral Graph (PAG).

FCI-Example

Functional Causal Models (FCMs)

Assumptions

  1. Independent noise assumption: Independence between the causes X and noises E.
  2. Independent mechanism assumption: Independence between the causes X and process f.

IN

Independent Noise (IN) Condition

  • Causal Asymmetry in the Linear non-Gaussian Case: Y=αX+E, where XY.

FCM

Functional-Based Methods: LiNGAM

LiNGAM Model

  • LiNGAM can be expressed as:X=BX+E
  • Assumptions:
    • X: observed variables.
    • B: connection weights.
    • E: non-Gaussian noise vector.

LiNGAM: Analysis by ICA

ICA

LiNGAM Example

[E1E3E2]=[1000.5100.20.31][X2X3X1]{X2=E1X3=0.5X2+E3X1=0.2X2+0.3X3+E2

Functional-Based Methods: PNL (post-NonLinear method)

  • PNL Model:

    vj=f2(f1(vi)+nj)
    • vi and nj are independent.
    • f1 is a non-constant smooth function.
    • f2 is a reversible smooth function.

PNL

Hybrid Methods

  • Hybrid Approach: Combines constraint-based and functional approaches.
  • Examples:
    • SELF (Structural Equational Likelihood Framework)
    • FRITL (Functional Representation with Independent Triad and Likelihood)

Comparison of Methods

PCFCIGESLiNGAM/PNL/ANMSELFFRITL
Faithfulness assumption required?YesYesSome weaker condition required (not totally clear yet)NoNoNo
Specific assumptions on data distributions required?NoNoYes (usually assumes linear-Gaussian models or multinomial distributions)YesYesYes
Properly handle confounders?NoYesNoNoNoYes
OutputMarkov equivalence classPartial ancestral graphMarkov equivalence classDAG as well as causal model (under the respective identifiability conditions)DAG with likelihood-based causal structure (assumes observed variables)DAG or PAG, refined with ICA and Triad condition for latent confounders

Thank You for Listening!